NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Identification of a Novel Gene MtbZIP60 as a Negative Regulator of Leaf Senescence through Transcriptome Analysis in Medicago truncatula

https://doi.org/10.3390/ijms251910410

Xing, Jiayu; Wang, Jialan; Cao, Jianuo; Li, Ke; Meng, Xiao; Wen, Jiangqi; Mysore, Kirankumar S; Wang, Geng; Zhou, Chunjiang; Yin, Pengcheng (October 2024, International Journal of Molecular Sciences)

Leaves are the primary harvest portion in forage crops such as alfalfa (Medicago sativa). Delaying leaf senescence is an effective strategy to improve forage biomass production and quality. In this study, we employed transcriptome sequencing to analyze the transcriptional changes and identify key senescence-associated genes under age-dependent leaf senescence in Medicago truncatula, a legume forage model plant. Through comparing the obtained expression data at different time points, we obtained 1057 differentially expressed genes, with 108 consistently up-regulated genes across leaf growth and senescence. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses showed that the 108 SAGs mainly related to protein processing, nitrogen metabolism, amino acid metabolism, RNA degradation and plant hormone signal transduction. Among the 108 SAGs, seven transcription factors were identified in which a novel bZIP transcription factor MtbZIP60 was proved to inhibit leaf senescence. MtbZIP60 encodes a nuclear-localized protein and possesses transactivation activity. Further study demonstrated MtbZIP60 could associate with MtWRKY40, both of which exhibited an up-regulated expression pattern during leaf senescence, indicating their crucial roles in the regulation of leaf senescence. Our findings help elucidate the molecular mechanisms of leaf senescence in M. truncatula and provide candidates for the genetic improvement of forage crops, with a focus on regulating leaf senescence.
more » « less
Full Text Available
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data

https://doi.org/10.18653/v1/2022.acl-long.214

Zhou, Shuyan; Zhang, Li; Yang, Yue; Lyu, Qing; Yin, Pengcheng; Callison-Burch, Chris; Neubig, Graham (January 2022, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Procedures are inherently hierarchical. To “make videos”, one may need to “purchase a camera”, which in turn may require one to “set a budget”. While such hierarchical knowledge is critical for reasoning about complex procedures, most existing work has treated procedures as shallow structures without modeling the parent-child relation. In this work, we attempt to construct an open-domain hierarchical knowledge-base (KB) of procedures based on wikiHow, a website containing more than 110k instructional articles, each documenting the steps to carry out a complex procedure. To this end, we develop a simple and efficient method that links steps (e.g., “purchase a camera”) in an article to other articles with similar goals (e.g., “how to choose a camera”), recursively constructing the KB. Our method significantly outperforms several strong baselines according to automatic evaluation, human judgment, and application to downstream tasks such as instructional video retrieval.
more » « less
Full Text Available
Learning to Superoptimize Real-World Programs

Sypula, A. G.; Yin, Pengcheng; Lacomis, Jeremy; Le Goues, Claire; Schwarts, Edward J; Neubig, Graham (January 2022, Deep Learning for Code Workshop (ICLR 2022 Workshop))

Program optimization is the process of modifying software to execute more efficiently. Superoptimizers attempt to find the optimal program by employing significantly more expensive search and constraint solving techniques. Generally, these methods do not scale well to programs in real development scenarios, and as a result superoptimization has largely been confined to small-scale, domain-specific, and/or synthetic program benchmarks. In this paper, we propose a framework to learn to superoptimize real-world programs by using neural sequence-to-sequence models. We created a dataset consisting of over 25K real-world x86-64 assembly functions mined from open-source projects and propose an approach, Self Imitation Learning for Optimization (SILO) that is easy to implement and outperforms a standard policy gradient learning approach on our dataset. Our method, SILO, superoptimizes 5.9% of our test set when compared with the gcc version 10.3 compiler’s aggressive optimization level -O3. We also report that SILO’s rate of superoptimization on our test set is over five times that of a standard policy gradient approach and a model pre-trained on compiler optimization demonstration.
more » « less
Full Text Available
DIRE and its Data: Neural Decompiled Variable Renamings with Respect to Software Class

https://doi.org/10.1145/3546946

Dramko, Luke; Lacomis, Jeremy; Yin, Pengcheng; Schwartz, Edward J.; Allamanis, Miltiadis; Neubig, Graham; Vasilescu, Bogdan; Le Goues, Claire (January 2022, ACM Transactions on Software Engineering and Methodology)

The decompiler is one of the most common tools for examining executable binaries without the corresponding source code. It transforms binaries into high-level code, reversing the compilation process. Unfortunately, decompiler output is far from readable because the decompilation process is often incomplete. State-of-the-art techniques use machine learning to predict missing information like variable names. While these approaches are often able to suggest good variable names in context, no existing work examines how the selection of training data influences these machine learning models. We investigate how data provenance and the quality of training data affect performance, and how well, if at all, trained models generalize across software domains. We focus on the variable renaming problem using one such machine learning model, DIRE . We first describe DIRE in detail and the accompanying technique used to generate training data from raw code. We also evaluate DIRE ’s overall performance without respect to data quality. Next, we show how training on more popular, possibly higher quality code (measured using GitHub stars) leads to a more generalizable model because popular code tends to have more diverse variable names. Finally, we evaluate how well DIRE predicts domain-specific identifiers, propose a modification to incorporate domain information, and show that it can predict identifiers in domain-specific scenarios 23% more frequently than the original DIRE model.
more » « less
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, The Ninth International Conference on Learning Representations 2021 (ICLR'21))
null (Ed.)
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank F.; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, International Conference on Learning Representations)
null (Ed.)
Full Text Available
Incorporating External Knowledge through Pre-training for Natural Language to Code Generation

https://doi.org/10.18653/v1/2020.acl-main.538

Xu, Frank F.; Jiang, Zhengbao; Yin, Pengcheng; Vasilescu, Bogdan; Neubig, Graham (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Full Text Available
Reranking for Neural Semantic Parsing

https://doi.org/10.18653/v1/P19-1447

Yin, Pengcheng; Neubig, Graham (July 2019, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics)

Full Text Available
DIRE: A Neural Approach to Decompiled Identifier Naming

Lacomis, Jeremy; Yin, Pengcheng; Schwartz, Edward J.; Allamanis, Miltiadis; Le Goues, Claire; Neubig, Graham; Vasilescu, Bogdan (November 2019, The 34th IEEE/ACM International Conference on Automated Software Engineering)

Full Text Available
TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation

https://doi.org/10.18653/v1/D18-2002

Yin, Pengcheng; Neubig, Graham (January 2018, Proceedings of the Conference on Empirical Methods in Natural Language Processing (Demo Track))

Full Text Available

« Prev Next »

Search for: All records